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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



U.S. National Phase Patent Application Based On 
PCT/US99/29213 

Serial No. : To be assigned 

Filing Date: Herewith 

For: PRODUCTION OF RECOMBINANT 
MONELLIN USING 
METHYLOTROPHIC YEAST 
EXPRESSION SYSTEM 



Assistant Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir: 

Preliminary to the examination of the above-captioned application, please amend the 
application as follows. 

IN THE SPECIFICATION: 

Please replace the paragraph beginning at page 5, line 23, with the following rewritten 
paragraph: 




Rhea Amid 



In the application of: 



Examiner: To be assigned 



Lingxun DUAN 



Group Art Unit: To be assigned 



PRELIMINARY AMENDMENT 



sd-45643 



— FIG. 1 sho ws the amino acid sequence of a recombinant single-chain monellin protein 
(SEQ ID NO:5) and the DNA sequence encoding the recombinant single-chain monellin protein 
(SEQ ID NO:6). Amino acid residues 1-50 corresponds to the amino acid residues 1-50 of the B 
chain of native monellin; amino acid residue 51 is Glycine as the hnker; and amino a:cid residues 
52-96 correspond to the amino acid residues 1-45 of the A chain of native monellin — 

Please replace the paragraph beginning at page 5, line 29, with the following rewritten 
paragraph: p: 

— FIG. 2 shows the DNA sequences of the oligos which were used for synthesis of the 
recombinant single-chain monellin protein (SEQ ID NOs:7-14). ~ 

IN THE FIGURES: 

Please replace figures 1 and 2 with substitute figures 1 and 2. 
IN THE ABSTRACT: 

After the Drawings, please add the following Abstract: 

—The present invention relates to a single-chain monellin-like protein which is stable and 
which is at least 100-fold sweet as compared to sucrose on the weight basis. The present 
invention also relates to a nucleic acid encoding said monellin-like protein. Preferably, the 
nucleic acid further comprises a promoter and a signal sequence for directing expression and 
secretion of the encoded monellin-like protein in the methylothrophic yeast Pichia pastoris. The 
present invention further relates to a recombinant Pichia pastoris cell containing the nucleic acid 
encoding the monellin-like protein, a process for producing the monellin-like protein from the 
recombinant Pichia pastoris and product of the process. ~ 
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REMARKS 



Please insert the attached paper copy of the Sequence Listing in the above-captioned 
patent appHcation. Computer readable copy of the Sequence Listing (CRP copy) accompanies 
this Amendment. 

The undersigned hereby states that the computer readable form copy (CFR copy) of the 
Sequence Listing and the paper copy of the Sequence Listing, submitted in accordance with 37 
C.F.R. § 1 .825(a) and (b), respectively, are the same and contain no new matter. Accordingly, 
entry of the Sequence Listing into the above-captioned case is respectfully requested. 

Also attached hereto is a marked-up version of the changes made to the specification by 
the current amendment. The attached page is captioned ^'Version with markings to show 
changes made. 
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Serial No. To be assigned 
Docket No. 464332000200 



In the unlikely event that the transmittal letter is separated from this document and the 
Patent Office determines that an extension and/or other relief is required, applicant petitions for 
any required relief including extensions of time and authorizes the Assistant Commissioner to 
charge the cost of such petitions and/or other fees due in connection with the filing of this 
document to Deposit Account No. 03-1952 referencing docket no. 464332000200 . However, 
the Assistant Commissioner is not authorized to charge the cost of the issue fee to the Deposit 
Accoimt. 



Respectfully submitted. 



Dated: 



June 26, 2001 




Peng Chen 

Registration No. 43,543 



Morrison & FoersterLLP 
38 11 Valley Centre Drive, Suite 500 



San Diego, CA 92130-2332 
Telephone: (858) 720-5117 
Facsimile: (858) 720-5125 



sd-45643 



4 



Serial No. To be assigned 
Docket No. 464332000200 



VERSION WITH MARKINGS TO SHOW CHANGES MADE 



In the Specification: 

On page 5, line 24, after "protein" insert --(SEQ ID NO:5)-; 
On page 5, line 25, after "protein" insert --(SEQ ID NO:6)-; 
On page 5, line 30, after "protein" insert -(SEQ ID NOs:7- 14)-. 
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GGT GAG TGG GAG ATT ATT GAG ATT GGT CCA TIC ACT 
Gly Gly Trp GIu lie He Asp He Gly Pro Phe Thr 

CAA AAC TTG GGT AAG TTC GCT GTT GAC GAG GAG AAC 
Gin Asn Leu Gly Lys Phe Ala Val Asp GIu Glu Asn 

AAG ATT GGT CAA TAG GGT AGA TTG ACT TTC AAC AAG 
Lys He Gly Gin Tyr Gly Arg Leu Thr Phe Asn Lys 

GTT ATT AGA CCA TGT ATG AAG AAG ACT ATT TAC GAG 
Val He Arg Pro Cys Met Lys Lys Thr He Tyr Glu 

AAC GAG GGT TCT AGA GAG ATT AAG GGT TAC GAG TAC 
Asn Glu jGiv. Ser Arg Glu He Lys Gly Tyr Glu Tyr 

CAA TTG TAC GTT TAC GCT TCT GAC AAG TTG TTC CGT 
Gin Leu Tyr Val Tyr Ala Ser Asp Lys Leu Phe Arg 

GCT GAC ATT TCT GAG GAC TAC AAG ACT CGT GGT CGT 
Ala Asp He Ser GIu Asp Tyr Lys Thr Arg Gly Arg 

AAG TTG TTG AGA TTC AAC GGT CCA GTT CCA CCA CCA 
Lys Leu Leu Arg Phe Asn Gly Pro Val Pro Pro Pro 

TAA (SEQ ID NO:6) 
stop (SEQ ID NO:S) 



FIG.l 
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Ml 

5' AG A ATT CGG TGA GTG GGA GAT TAT TGA CAT TGG TCC ATT 
CAC TCA AAACTTGG 3' (SEQ ID NO:7) 

M2 

5' GAA C AA GAT TGG TCA ATA CGG TAG ATT GAC TTT CAA CAA 
GTT TAT TAG GCC ATG T 3' (SEQ ID NO:8) 

M3 

V GAG ACC GAG GGT TCT AGA GAG ATT AAG GGT TAG GAG TAG 
5 GAG ACC ^AU ^. ^^^Q 

M4 

5' GTG CTG ACA TTC CTG AGG ACT ACA AGA CTC GTG GTC GTA 
AGT TGT TGA GAT TC 3' (SEQIDNO:10) 

Nl 

5' GTA TTG ACC AAT CTT GTT CTC CTC GTC AAC ^GC GAA CTT 
ACC CAA GTT TTG AGT GAA TG 3' (SEQ ID NO:ll) 

N2 

5' CTCTAGAACCCTCGTTCTCGTAAATAGTCTTCTTCATAC 
ATGGTCTAATAACCTTG 3' (SEQiDNO:l2) 

N3 

5' GTC CTC AGA AAT GTC AGC ACG GAA CAA CTT GTC AGA AGC 
GTA AAC GTA CAA TTG (SEQIDNO:l3) 

N4 

5' AGA ATT CTT ATG GTG GTG GAA CTG GAC CGT TGA ATC TCA 
ACA ACT TAG GAC 3' (SEQiDNO:i4) 



FIG. 2 
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PRODUCTION OF RECOMBINANT MONELLIN USING 
METHYLOTROPHIC YEAST EXPRESSION SYSTEM 

This application claims the benefit of priority under 35 U.S.C. §119(e) to 
U.S. provisional application Serial No. 60/114,529 to Lingxun Duan, filed 
December 31, 1998, and entitled PRODUCTION OF RECOMBINANT 
MONELLIN USING METHYLOTROPHIC YEAST EXPRESSION SYSTEM. 



1. FIELD OF THE INVENTION 

10 The present invention relates to a single-chain monellin-like protein which is 

stable and which is at least 100-fold sweet as compared to sucrose on the weight 
basis. The present invention also relates to a nucleic acid encoding said monellin- 
like protein. Preferably, the nucleic acid further comprises a promoter and a signal 
sequence for directing expression and secretion of the encoded monellin-like 

1 5 protein in the methylotrophic yeast Pichia pastoris. The present invention further 
relates to a recombinant Pichia pastoris cell containing the nucleic acid encoding 
the monellin-like protein, a process for producing the monellin-like protein from 
the recombinant Pichia pastoris and product of the process. 



20 2. BACKGROUND ART 

2.1. MONELLIN 

Monellin belongs to a family of intensely sweet proteins derived from 
tropical plants (Dansby, Nature Biotechnology, 1997, 15:419-420). Monellin is 
about 3,000-fold sweet as compared to sucrose. Other similar proteins include 

25 thaumatin, miracuhn, mabinlin, pentadin and aspartame ( Id.) Monellin was first 
isolated from the West African Plant Dioscoreophyllum comminisii (U.S. Patent 
Nos. 3,878,184 and 3,998,798; Morris and Cagan, Biochim. Biophys. Acta, \972, 
261 :1 14-122). The amino acid sequence, the three-dimensional structure and 
various physical and chemical properties of monellin have been characterized 

30 (Ogata, et al.. Nature, 1987, 328:739-742; Morris et al., J. Biol. Chem., 1973, 
248:534-539; Cagan, Science, 1973, 181:32-35; Bohak and Li, Biochim. Biophys. 
Acta, 1976, 427 : 153-170; Hudson and Beeman, Biochem. Biophys. Res. Comm., 
1976, 21:212-220; Van der Wei and Loeve, FEBS Lett., 1973, 29:181-183; and 
Frank and Zuber, HoppeSeyler's Z Physiol. Chem., 1976, 357:585-592). 
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U.S. Patent No. 4,300,576 discloses smoking articles containing thaumatin 
or monellin. U.S. Patent No. 4,562,076 discloses chewing gum with coating of 
thaumatin or monellin. U.S. Patent No. 4,412,984 discloses flavor potentiated oral 
compositions containing thaumatin or monellin. However, despite its potential as 
5 low-calorie sweeteners, wide commercial application of monellin is hampered by 
concerns over its poor stability to heat and pH, lack of access to sources of supply 
of the plant and uncertainty in the regulatory climate for food additives (Dansby, 
Nature Biotechnology, 1997,15:419-420). 

In 1989, Sung-Hou Kim's group reported production of single-chain 
10 monellin in E. coU by genetic engineering (Kim et al.. Protein Eng., 1989, 2:571- 
575). The purified single-chain monellin was found to be more heat-stable and 
tolerant to a wide pH range, but retained the intensity of sweetness. Several 
aspects of this invention have been the subject of certain U.S. patents. For 
example, U.S. Patent No. 5,234,834 discloses constructs for expression of single- 

15 chain monellin in plant cells. U.S. Patent No. 5,487,923 discloses a sweet 

proteinaceous compound of the formula B-C-A, wherein B represents a peptide 
portion at least 90% homologous to residues 1-46 of the B chain of native monellin 
and modified only by conservative substitutions; C is a covalent bond or is a 
hydrophilic, physiologically acceptable covalent linker capable of providing a 

20 spacing length equivalent to a peptide of l-IO amino acids selected so as to reside 
on the external portion of the molecule and not to disturb the native conformation; 
and A represents a peptide at least 90% homologous to residues 6-45 of the A 
chain of native monellin and modified only by conservative substitution. U.S. 
Patent No. 5,487,983 discloses an expression system for making the single-chain 

25 monellin disclosed in U.S. Patent No. 5,487,923. U.S. Patent No. 5,670,339 
discloses DN.A. encoding the single-chain monellin disclosed in U.S. Patent No. 
5.487,923. U.S. Patent No. 5,672,372 discloses methods for sweetening a food 
composition with the single-chain monellin disclosed in U.S. Patent No. 5,487,923. 
U.S. Patent No. 5,264,558 discloses a single-chain monellin protein that is, in a 

30 standard taste test, at least 50 times that of sucrose on a weight basis. 

Recently, Kondo et al.. Nature Biotechnology, 1997, 15:453-457 discloses 
heterologous expression of a single-chain monellin protein in the yeast Candida 
utilis intracellularly. It reports that monellin was produced at a high level, 
accounting for >50% of the soluble protein. 
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2.2. EXPRESSION OF HETEROLOGOUS PROTEINS 
IN PICHJA PASTORIS 

The methylotrophic yeast Pichia pastoris has been used as a protein 
5 expression system. Several aspects of this expression system have been the subject 
of certain U.S. patents. For example, U.S. Patent No. 4,837,148 discloses 
autonomous replication sequences for Pichia pastoris. U.S. Patent No. 4,855,231 
discloses regulatory region for heterologous gene expression in Pichia pastoris 
cells. U.S. Patent No. 4,882,279 discloses site selective genomic modification of 
10 Pichia pastoris. U.S. Patent No. 4,929,555 discloses a method for making whole 
cells Pichia pastoris competent for transformation. U.S. Patent No. 5,122,465 
discloses a process for generating a selectable phenotype in strains of Pichia 
pastoris. U.S. Patent No. 5,324,639 discloses production of insulin-like growth 
factor- 1 in methylotrophic cells, including Pichia pastoris cells. 
15 A number of signal sequences have been used to direct secretion of 

heterologous proteins expressed in Pichia pastoris cells. Examples of such signal 
sequences include, but are not limited to, the signal sequence Pichia pastoris 
acid phosphatase, the signal sequence of Aspergilhts giganteus alpha-Sarcin 
(Martinez-Ruiz et al.. Protein Expr. Purif., 1998, 12(3) :315-22: Abdulaev et al., 
20 Protein Expr. Purif., 1997, 10(0 :6 1-9: Kotake et al., J. Lipid Res., 1996, 
37(3 ):599-6Q5). the signal sequence of alpha-N-Acetylgalactosaminidase 
(alphaNAGAL, EC 3.2.1.49) (Zhu et al.. Arch. Biochem. Biophys., 1998, 
352(1 ) :l-8), the signal peptide of the OmpA protein (Heim et al., Biochim. 
Biophys. Acta., 1998, 1396(3') :306-19), the signal sequence of the mouse 
25 alpha-factor signal (cCell) or the native signal sequence of pepper 

endo-beta-I,4-glucanases (Ferrarese et al., FEBS Lett., 1998. 422(1 ') :23-6). signal 
peptide of laccase isolated from the ligninolytic fungus Trametes (Jonsson et al., 
Curr. Genet., 1997, 32(6}:425-30), signal peptide of murine lysosomal acid 
alpha-mannosidase (Merkie et al., Biochim. Biophys. Acta., 1997, 1336(2) : 132-46), 
30 signal peptide of the porcine inhibitor of carbonic anhydrase (Wuebbens et al.. 
Biochemistry, 1997, 36(14) :4327-36). signal sequence of Aspergillus awamori 
glucoamylase (Fierobe et al.. Protein Expr. Purif., 1997, 9(2) : 159-70). signal 
sequence of mouse major urinary protein (Ferrari et al., FEBS Lett., 1997, 
401(l):73-7). signal sequence of phol (Skory et al., Curr. Genet., 1996, 



wo 00/40603 PCT/US99/29213 

4 

30(5) :417-22), signal sequence of rabbit angiotensin-converting enzyme (ACE) 
(Sadhukhan et al., J. Biol. Chem., 1996, 271X30:18310-3), prepeptide sequence of 
Pichia pastoris aspartic proteinase (Tsujikawa et al.. Yeast, 1996, 12(6) :541-53). 
signal sequence of Pichia pastoris PRCl (Ohi et al.. Yeast, 1996, 1 2(1 ) :3 1-40), the 
5 signal sequence of a bacterial thermostable alpha amylase and SUC2 gene signal 
sequence from Saccharomyces cerevisiae (Paifer et al.. Yeast, 1994, 10(Il) :1415-9) 
and the signal sequence Saccharomyces cerevisiae mating pheromone a- factor 
(Fidler et al., J. Mol Endocrinol., 1998, 21(3) :327-336). 

Although the methylotrophic yeast Pichia pastoris has been used 
10 successfully for the production of various heterologous proteins, U.S. Patent No. 
5,324,639 discloses that at the present level of understanding of methylotrophic 
yeast expression systems, it is unpredictable whether a given gene can be expressed 
to an appreciable level in such yeast or whether the yeast host will tolerate the 
presence of the recombinant gene product in its cells. U.S. Patent No. 5,324,639 
15 further discloses that it is especially difficult to foresee if a particular protein will 
be secreted by the methylotrophic yeast host, and if it is, at what efficiency. For 
example, VoHmer et al., J. Immunol. Methods, 1996, 199(1 ) :47-54. reports that 
when the 323 amino acid residues of the human sIL-6R are inserted into an 
expression/secretion vector suitable for the methylotrophic yeast Pichia pastoris, no 
20 detectable expression and secretion of the recombinant protein was obtained. Up to 
date, monellin has not been expressed and secreted using the Pichia pastoris 
expression system. 

Given the great interest in the commercial application of monellin, there is a 
great need for a more efficient method for producing stable monellin which still 
25 retains its native sweet intensity and which simplify down stream purification 
procedures. The present invention addresses these and other needs in the art. 
Citation of references hereinabove shall not be construed as an admission that such 
references are prior art to the present invention. 

30 3. SUMMARY OF THE INVENTION 

The present invention relates to an isolated nucleic acid comprising a 
nucleotide sequence encoding a chimeric protein, said chimeric protein comprises, 
from N-terminus to C-terminus: a) a first peptidyl fragment consisting of an amino 
acid sequence that has at least 40% identity to residues 1-50 of the B chain of 
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native monellin, in which the percentage identity is determined over an amino acid 
sequence of identical size to the B chain of native moneltin; b) a peptidyl bond, or 
a second peptidyl fragment consisting of 1-12 amino acids; and c) a third peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 
5 residues 1-45 of the A chain of native monellin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the A chain of native 
monellin, wherein said chimeric protein is stable and a given amount of said 
chimeric protein is at least 100-fold sweet as compared to the identical amount of 
sucrose, and within said nucleic acid, codons which are preferably used by yeast 

10 cells are used. Preferably, the isolated nucleic acid further encodes a promoter 

which is capable of directing protein expression in Pichia pastoris and/or an amino 
acid sequence which is capable of directing secretion of the encoded chimeric 
protein from Pichia pastoris. 

The present invention also relates to a recombinant Pichia pastoris cell 

15 containing the above nucleic acids. The present invention further relates to a 

process for producing a chimeric protein comprising growing a recombinant Pichia 
pastoris cell containing the above nucleic acid such that the encoded chimeric 
protein is expressed and secreted by the cell, and recovering the expressed and 
secreted chimeric protein. Finally, the present invention relates to products of the 

20 above processes. 

4. BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the amino acid sequence of a recombinant single-chain 
monellin protein and the DNA sequence encoding the recombinant single-chain 
25 monellin protein. Amino acid residues 1-50 corresponds to the amino acid residues 
1-50 of the B chain of native monellin; amino acid residue 51 is Glycine as the 
linker; and amino acid residues 52-96 corresponds to the amino acid residues 1-45 
of the A chain of native monellin. 

FIG. 2 shows the DNA sequence of the oligos which were used for 
30 synthesis of the recombinant single-chain monellin protein. 

FIG. 3 shows the location of each of DNA oligo in the synthesized monellin 
DNA and its enzymatic digestion sites. 

FIG. 4 shows the restriction map of recombinant monellin protein expression 
vector pGWYSl. 
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FIG. 5 shows the construction of pGWYSl. 

FIG.6 shows the SDS-PAGE analysis of the secreted recombinant monellin 
protein isolated from the culture medium. 

FIG.7 shows the steps for purifying the secreted recombinant monellin 

5 protein. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a nucleic acid encoding a single-chain 
monellin-like protein which is stable and which is at least 100-fold sweet as 

10 compared to sucrose on the weight basis. Preferably, the nucleic acid further 

comprises a promoter and a signal sequence for directing expression and secretion 
of the encoded monellin-like protein in the methylotrophic yeast Pichict pasioris. 
The present invention also provides a recombinant Pichia pastoris cell containing 
the nucleic acid encoding the monellin-like protein, a process for producing the 

15 monellin-like protein from the recombinant Pichia pastoris and product of the 
process. 

For clarity of disclosure, and not by way of limitation, the detailed 
description of the invention is divided into the subsections which follow. 

20 5.1. NUCLEIC ACIDS ENCODING 

THE SINGLE-CHAIN MONELLIN PROTEINS 

The present invention provides an isolated nucleic acid comprising a 

nucleotide sequence encoding a chimeric protein, said chimeric protein comprises, 
from N-terminus to C-terminus: a) a first peptidyl fragment consisting of an amino 

25 acid sequence that has at least 40% identity to residues 1-50 of the B chain of 

native monellin, in which the percentage identity is determined over an amino acid 
sequence of identical size to the B chain of native monellin; b) a peptidyl bond, or 
a second peptidyl fragment consisting of 1-12 amino acids; and c) a third peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 

30 residues 1-45 of the A chain of native monellin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the A chain of native 
monellin, wherein said chimeric protein is stable and a given amount of said 
chimeric protein is at least 100-fold sweet as compared to the identical amount of 



wo 00/40603 



PCT/US99/29213 



sucrose, and within said nucleic acid, codons which are preferably used by yeast 
cells are used. 

In a specific embodiment, the present invention provides an isolated nucleic 
acid comprising a nucleotide sequence encoding the chimeric protein wherein the 
5 first peptidyl fragment consists of an amino acid sequence that has at least 60% 
identity to the B chain of native monellin. Preferably, the first peptidyl fragment 
consists of an amino acid sequence that has at least 90% identity to the B chain of 
native monellin. More preferably, the first peptidyl fragment consists of the amino 
acid residues 1-50 of the B chain of native monellin. 

10 In another specific embodiment, the present invention provides an isolated 

nucleic acid comprising a nucleotide sequence encoding the chimeric protein 
wherein the second peptidyl fragment consists of the amino acid sequence Gly-Gly- 
Gly-Ser-Gly-GIy-Gly-Ser-Gly-Gly-Gly-Ser (SEQ ID NO:l). Preferably, the second 
peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser (SEQ ID 

15 N0:2). More preferably, the second peptidyl fragment consists of amino acid 
residue Gly. 

In still another specific embodiment, the present invention provides an 
isolated nucleic acid comprising a nucleotide sequence encoding the chimeric 
protein wherein the third peptidyl fragment consists of an amino acid sequence that 

20 has at least 60% identity to the A chain of native monellin. Preferably, the third 
peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the A chain of native monellin. More preferably, the third peptidyl fragment 
consists of the amino acid residues 1-45 of the A chain of native monellin. 

In a preferred embodiment, the present invention provides an isolated 

25 nucleic acid comprising a nucleotide sequence encoding the chimeric protein 

wherein the first peptidyl fragment consists of the amino acid residues 1-50 of the 
B chain of native monellin, the second peptidyl fragment consists of the amino acid 
residue Gly and the third peptidyl fragment consists of the amino acid residues 1- 
45 of the A chain of native monellin. 

30 In a specific embodiment, the present invention provides an isolated nucleic 

acid comprising a nucleotide sequence encoding the chimeric protein which is 
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capable of being immunoreactively bound by an anti-monellin or an anti-thaumatin 
antibody. 

In another specific embodiment, the present invention provides an isolated 
nucleic acid comprising a nucleotide sequence encoding the chimeric protein 
5 wherein the chimeric protein further comprises an amino acid sequence which is 
capable of directing secretion of said chimeric protein from Pichia pastoris. 
Preferably, the secretion-directing sequence is an endogenous signal sequence of 
Pichia pastoris. More preferably, the endogenous signal sequence is selected from 
the group consisting of the signal sequence of Pichia pastoris acid phosphatase, 
10 Pichia pastoris aspartic proteinase and Pichia pastoris carboxypeptidase Y encoded 
by Pichia pastoris PRCl. Alternatively, the secretion-directing sequence is a yeast 
signal sequence, wherein said yeast is not Pichia pastoris. Preferably, the yeast 
signal sequence is a signal sequence from Saccharomyces cerevisiae. More 
preferably, the Saccharomyces cerevisiae signal sequence is selected from the group 
15 consisting of the signal sequence of Saccharomyces cerevisiae SUC 2 and 
Saccharomyces cerevisiae mating pheromone C7-factor. Most preferably, the 
Saccharomyces cerevisiae signal sequence is the signal sequence of 
Saccharomyces cerevisiae mating pheromone a-factor. Examples of other 
secretion-directing sequences that can be used in the present invention include, but 
20 are not limited to, the signal sequence of Aspergillus giganteus alpha-Sarcin, 

alpha-N-Acetylgalactosaminidase, OmpA protein, the mouse alpha- factor (cCell), 
the pepper endo-beta-l,4-glucanases, the laccase isolated from the ligninolytic 
fungus Trametes^ murine lysosomal acid alpha-mannosidase, the porcine inhibitor 
of carbonic anhydrase, Aspergillus awamori glucoamylase, mouse major urinary 
25 protein, phol, rabbit angiotensin-converting enzyme (ACE), and the bacterial 
thermostable alpha amylase. 

In a specific embodiment, the present invention provides an isolated nucleic 
acid comprising a nucleotide sequence encoding the chimeric protein which nucleic 
acid is a DNA. In another specific embodiment, the present invention provides an 
30 isolated nucleic acid which is hybridizable to the DNA sequence encoding the 
chimeric protein. In still another specific embodiment, the present invention 
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provides an isolated nucleic acid comprising a nucleotide sequence complementary 
to the nucleotide sequence encoding the chimeric protein. 

In a specific embodiment, the present invention provides a DNA encoding 
the chimeric protein which DNA further comprises a promoter which is capable of 
directing protein expression in Pichia pastoris. Preferably, the promoter is an 
endogenous promoter of Pichia pastoris. More preferably, the endogenous 
promoter is the promoter of Pichia pastoris gIyceraIdehyde-3-phosphate 
dehydrogenase (GAP). Alternatively, although not preferred, promoters of 
methanol responsive genes in methylotrophic yeast can also be used. Examples of 
such methanol responsive promoters include, but are not limited to, the promoter 
for the primary alcohol oxidase gene from Pichia pastoris AOXl, the promoter for 
the secondary alcohol oxidase gene from Pichia pastoris AOX2, the promoter for 
the dihydroxyacetone synthase gene from Pichia pastoris (DAS), the promoter for 
the P40 gene from Pichia pastoris, the promoter for the catalase gene from Pichia 
pastoris, and the like {see U.S. Patent No. 5,324,639). 

In another specific embodiment, the present invention provides a DNA 
encoding the chimeric protein which DNA further includes sequences allowing for 
its replication and selection in bacteria. In this way, large quantities of the DNA 
fragment can be produced by replication in bacteria. 

In a preferred embodiment, the present invention provides a DNA encoding 
the chimeric protein, wherein within the encoded chimeric protein, the first peptidyl 
fragment consists of the amino acid residues 1-50 of the B chain of native 
monellin, the second peptidyl fragment consists of the amino acid residue Gly and 
the third peptidyl fragment consists of the amino acid residues 1-45 of the A chain 
of native monellin, and said DNA further comprises the promoter of Pichia 
pastoris GAP and the signal sequence of Saccharomyces cerevisiae mating 
pheromone a-factor. 

In another preferred embodiment, the present invention provides a DNA 
encoding the chimeric protein, wherein the codons which are preferably used by 
Pichia pastoris cells are used. 
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In a most preferred embodiment, the present invention provides a DNA 
encoding the chimeric protein wherein the DNA molecule comprises nucleotide 
sequence as depicted in Figure 1 or the DNA vector as depicted in Figure 4. 

The nucleic acid comprising a nucleotide sequence encoding the chimeric 
5 protein disclosed herein, or any fragments, analogues or derivatives thereof, can be 
obtained by any method(s) known in the art. The nucleic acid may be chemically 
synthesized entirely. Alternatively, the nucleic acid encoding each fragment of the 
chimeric protein, i.e., the first, second or third peptidyl fragment, may be obtained 
by molecular cloning or may be purified from the desired cells. The nucleic acid 
10 encoding each fragment of the chimeric protein may then be chemically or 

enzymatically ligated together to form the nucleic acid comprising a nucleotide 
sequence encoding the chimeric protein disclosed herein, or any fragments, 
analogues or derivatives thereof. 

Any Dioscoreophyllum comminisii cell potentially can serve as the nucleic 
15 acid source for the isolation of the nucleic acids encoding monellin. Alternatively, 
the nucleic acids encoding monellin can be designed and synthesized according to 
the amino acid sequence of the native monellin depicted in Figure 1 {see also U.S. 
Patent No. 5,478,923). 

The DNA may be obtained by standard procedures known in the art from 
20 cloned DNA {e.g., a DNA "library"), by chemical synthesis, by cDNA cloning, or 
by the cloning of genomic DNA, or fragments thereof, purified from the desired 
cell {See, for example, Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor. New 
York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, 
25 Ltd., Oxford, U.K. Vol. I, II.) Clones derived from genomic DNA may contain 
regulatory and intron DNA regions in addition to coding regions; clones derived 
from cDNA will contain only exon sequences. WTiatever the source, the gene 
should be molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from cDNA, cDNA is generated from 
30 totally cellular RNA or mRNA by methods that are well known in the art. The 
gene may also be obtained from genomic DNA, where DNA fragments are * 
generated {e.g., using restricdon enzymes or by mechanical shearing), some of 
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which will encode the desired gene. The linear DNA fragments can then be 
separated according to size by standard techniques, including but not limited to, 
agarose and polyacrylamide gel electrophoresis and column chromatography. 

Once a nucleic acid comprising a nucleotide sequence encoding the chimeric 
5 protein disclosed herein, or any fragments, analogues or derivatives thereof, has 
been obtained, its identity can be confirmed by nucleic acid sequencing (by any 
method well known in the art) and comparison to the known sequences. DNA 
sequence analysis can be performed by any techniques known in the art, including 
but not limited to the method of Maxam and Gilbert (Maxam and Gilbert, 1980, 
10 Meth. Enzymol., 65:499-560), the Sanger dideoxy method (Sanger et al., 1977, 

Proc. Natl. Acad. Set. U.S.A., 74:5463), the use of T7 DNA polymerase (Tabor and 
Richardson, U.S. Patent No. 4,795,699), use of an automated DNA sequenator 
{e.g.. Applied Biosystems, Foster City, CA) or the method described in PCT 
Publication WO 97/15690. 
15 Nucleic acids which are hybridizable to a nucleic acid comprising a 

nucleotide sequence encoding the chimeric protein disclosed herein, or any 
fragments, analogues or derivatives thereof, can be isolated, by nucleic acid 
hybridization under conditions of low, high, or moderate stringency {See also Shilo 
and Weinberg, 1981, Proc. Natl. Acad. Sci. USA, 78:6789-6792). 
20 As used herein, "stable" means that a claimed single-chain monellin 

chimeric protein retains at least 70% of its sweet intensity after the protein has 
been placed at about 4°C for at least 6 months, or at about 60°C for at least 2.5 
hours, or at about 100''C for at least 5 minutes. In addition, *'stable'* means that a 
claimed single-chain moneriin chimeric protein retains at least 70% of its sweet 
25 intensity after the protein has been placed at a pH raging from about 2.0 to about 
1 1.0 for at least 6 hours. 

Sweetness of the claimed single-chain monellin chimeric protein can be 
assessed using an ordinary taste test that is known in the art. For example, 
comparison to the sweetness of sucrose can be made by suitable dilutions on a 
30 weight basis {see also U.S. Patent No. 5,478,923). 

The preferred codon usage by yeast cells can be determined by methods 
known in the art, e.g., methods disclosed in Sharp et al., Nucleic Acids Res., 1986, 
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14fI3') :5125-43 and in Li and Luo, J. Theor. Biol., 1996, 1 81(2) : 11 1-24. 
According to Sharp, important characteristics of the preferred codon in yeast 
include a higher correlation with tRNA abundance, a greater degree of third base 
pyrimidine bias, and a lesser tendency to the A+T bas pairs. Li and Luo discloses 
5 a method of classifying and predicting the gene expression level in E. coli and 
yeast cells which is called the Self-Consistent Information Clustering (SCIC). 
Using the modified Codon Adaption Index (CAI) values, Li and Luo have 
accomplished the linear regression analysis on the relation between base 
composition, base correlation and gene expression level in Escherichia coli and 
10 yeast. Li and Luo also proposed the assumption of Expression-Enhancing-Network 
Site (EENS), the existence of which can be demonstrated by the linear equations 
between gene expression and base correlations in a codon, in adjacent codons and 
in non-adjacent codons. In addition, the codons that have been successfully used 
for expressing heterologous proteins in Pichia pastoris cells can be used. Examples 
15 of such codons can be found in U.S. Patent No. 4,837,148; U.S. Patent No. 

4,855,231; U.S. Patent No. 4,882,279; U.S. Patent No. 4,929,555; U.S. Patent No. 
5,122,465; U.S. Patent No. 5,324,639; Martinez-Ruiz et al.. Protein Expr. Purif., 
1998, 12f3) :315-22: Abduiaev et al., Protei/i Expr. Ptirif., 1997. 10rn :61-9; 
Kotake et al., J. Lipid Res., 1996, 37(3 ):599-605: Zhu et al.. Arch. Biochem. 
20 Biophys., 1998, 352d) :l-8: Heim et al., Biochim. Biophys. Acta., 1998, 

1396r3) :3Q6-I9: Ferrarese et al., FEBS Lett., 1998. 422rn :23-6: Jonsson et al., 
Ciirr. Genet., 1997, 32('6) :425-30: Merkle et al., Biochim. Biophys. Acta., 1997, 
1336(2) : 132-46; Wuebbens et al.. Biochemistry, 1997, 36( 1 4) :4327-36; Fierobe et 
al.. Protein Expr Piirif., 1997, 9£2}: 159-70; Ferrari et al., FEBS Lett., 1997, 
25 40110:73-7; Skory et al., Curr. Genet., 1996, 30(5) :41 7-22: Sadhukhan et al., 

Biol. Chem., 1996, 27U31) :18310-3: Tsujikawa et al.. Yeast, 1996, 12f 61 :541-53; 
Ohi et al., Yeast, 1996, 120^:31-40; Paifer et al.. Yeast, 1994, lOf 1 n :1415-9; 
Fidler et al., /. Mol. Endocrinol., 1998, 2U3) :327-336: and Brocca et al.. Protein 
Set., 1998, 7(6):1415-22. 
30 Whether a chimeric protein is capable of being immunoreactively bound by 

an anti-monellin or an anti-thaumatin antibody can be determined by methods 
known in the art. The examples of anti-monellin or an anti-thaumatin antibodies 
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that can be used in the present invention include, but are not limited to, the 
antibodies disclosed in Slootstra et al., Chem. Senses, 1995, 20|'5) :535-43; 
Antonenko and Zanetti , Life Sci., 1994, 55(15) :! 187-92; Bodani et al., Hybridoma, 
1993, 12(2) :1 77-83; Mandal et al., Hybridoma, 1991, 1 0(4) 1459-66 and Haimovich, 
5 /sr. J. Med. Set., 1975, ll(in :1183. 

5.2. PRODUCTION OF MONELLIN PROTEINS 
FROM RECOMBINANT PICHIA PASTORIS CELLS 

In a specific embodiment, the present invention provides a recombinant 

10 Pichia pcistoris cell containing the nucleic acid which encodes a chimeric protein, 
said chimeric protein comprises, from N-terminus to C-terminus: a) a first peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 
residues 1-50 of the B chain of native monellin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the B chain of native 

15 monellin; b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 

amino acids; and c) a third peptidyl fragment consisting of an amino acid sequence 
that has at least 40% identity to residues 1-45 of the A chain of native monellin, in 
which the percentage identity is determined over an amino acid sequence of 
identical size to the A chain of native monellin, wherein said chimeric protein is 

20 stable and a given amount of said chimeric protein is at least 100- fold sweet as 
compared to the identical amount of sucrose, and within said nucleic acid, codons 
which arc preferably used by yeast cells are used. Preferably, the recombinant 
Pichia pustoris cell contains a DNA molecule comprises nucleotide sequence as 
depicted in Figure 1 or a DNA vector as depicted in Figure 4. Recombinant 

25 Pichia pustoris cells containing the nucleic acids disclosed in Section 4.1. are also 
provided. 

Methods for transforming methylotrophic yeast, such as Pichia pastoris, as 
well as methods applicable for culturing methylotrophic yeast cells containing in 
their genome a gene encoding a heterologous protein, are known generally in the 
30 art. Preferably, the transformation, positive transforrnant selection and culturing 
methods disclosed in U.S. Patent No. 4,837,148; U.S. Patent No. 4,855,231; U.S. 
Patent No. 4,882,279; U.S. Patent No. 4,929,555; U.S. Patent No. 5,122,465; U.S. 
Patent No. 5,324,639 can be used in the present invention. 
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In another specific embodiment, the present invention provides a process for 
producing a monelhn chimeric protein comprising growing a recombinant Pichia 
paston's cell containing the nucleic acid disclosed in Section 4.1. such that the 
encoded chimeric protein is expressed and secreted by the cell, and recovering the 
expressed and secreted chimeric protein. Preferably, the recombinant Pichia 
pastoris cell containing a DN.A molecule comprises nucleotide sequence as depicted 
in Figure I or a DNA vector as depicted in Figure 4 is used. 

Any suitable fermentation process in the art can be used in the present 
process. For large-scale production of recombinant DNA-based products driven by 
a GAP promoter in methylotrophic yeast such as Pichia pastoris, a three-stage, 
high cell-density fed batch fermentation system is preferably employed. In the first 
or growth stage of this fermentation system, the expression host Pichia pastoris 
cells are cultured in defined minimal medium such as BMGY (Buffered Minimal 
Glycerol-complex medium) with an excess of a non-inducing carbon source {e.g., 
glycerol). When the expression host Pichia pastoris cells are grown on such 
carbon sources, heterologous gene expression is repressed, which allows the 
generation of cell mass in the absence of heterologous protein expression. During 
this growth stage, it is also preferred that the pH of the medium be maintained at 
about 5. Next, the expression host Pichia pastoris cells are grown on limited non- 
inducing carbon source for a short period of time to further increase the cell mass 
and to depress the glucose responsive promoter. The pH of the medium during this 
limited growth period is kept below 4, preferably in the range from about 2.0 to 
about 3.5. The final stage is the production stage wherein either the "glucose 
excess fed-batch mode" or the "mixed-feed fed-batch mode" can be used. In the 
"glucose excess fed-batch mode," 2% glucose alone is added. In the "mixed-feed 
fed-batch mode," a limiting amount of a non-inducing carbon source and glucose is 
added in the fermentor to induce the expression of the monellin gene driven by a 
GAP promoter. 

The secreted monellin chimeric proteins can be recovered from the Pichia 
pastoris culture medium by any methods known in the art. For example, methods 
disclosed in U.S. Patent Nos. 3,878,184 and 3.998,798; Morris and Cagan, 
Biochim. Biophys. Acta, 1972, 261:114-122; Kim et al,. Protein Eng., 1989, 2:571- 
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575; and Recently, Kondo et al.. Nature Biotechnology, 1997, 15:453-457 can be 
used for recovering and isolating the secreted monellin chimeric proteins. 
Preferably, the expressed and secreted chimeric protein is recovered by a means 
comprising ion-exchange chromatography. More preferably, the expressed and 
5 secreted chimeric protein is recovered by a means comprising CM-Sephadex 
column chromatography or DEAE-Sephadex column chromatography. 

In another specific embodiment, the present invention provides the product 
of the above processes. 

10 6. EXAMPLE 

6.1. Preparation of the Synthetic Recombinant iVlonellm DNA 

The amino acid sequence of the recombinant monellin protein and the 
nucleotide sequence of the DNA encoding the recombinant monellin protein are 
shown in Figure 1. As shown in Figure 1, nucleotides 1-150 encode residues 1-50 
15 of the B chain of the native monellin protein; nucleotides 150-152 encode Glycine 
as the hnking "L" portion; and nucleotides 153-287 encode residues 1-45 of the A 
chain of the native monellin protein. The recombinant monellin protein is preceded 
by the following amino acid sequence, which corresponds to a Met residue and the 
signal sequence of Saccharomyces cerevisiae mating pheromone a-factor:Met-Leu- 
20 Leu-Phe-Ile-Asn-Thr-Thr-IIe-Ala-Ser-Ile-Ala-Ala-Lys-Glu-Glu-Gly-Val-Ser-Leu- 
G!u-Lys-Arg-Ghi-Ala-Giu-Ala-GIu-Phe (SEQ ID NO:3). 

This synthetic DNA encoding the signal sequence of Saccharomyces 
cerevisiae mating pheromone a-factor and the recombinant monellin protein was 
prepared from the oligos M1-M4 and N1-N4, which were synthesized using the 
25 Applied Biosystems 380B DNA Synthesizer by ACTG company {see Figures 2-3 
and 5). The oligos were isolated by urea-potyacrylamide gel electrophoresis and 
purified by passing through a Sep-pak CI 8 column (Whatman) and annealed and 
ligated as shown in Figure 3 to obtain the synthetic DNA bracketed by EcoRI sites. 
To synthesize the DNA encoding the signal sequence of Saccharomyces 
30 cerevisiae mating pheromone o-factor and the recombinant monellin protein, in 100 
ul PGR reaction volume, 2 pM of each of the oligo M2 to N3 were mixed with 10 
pM MI and N4, heated to 94°C for 5 minutes in the absence of Tag DNA 
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polymerase. The reaction mixture was then slowly cooled down to 37°C. After 1 
unit of the Vent DNA polymerase (New England Biolabs, Inc.) was added, the 
PGR reaction was performed according to the standard protocol. One hundred 
microliter PGR reaction mixture contains 50 mM Tris-HGl (pH 8.0), 2.5 mM 
5 MgGI.,, 10 mM DTT, 1 mM dNTP, and 1 unit of the Vent DNA polymerase. The 
PGR reaction was performed as following: at 94°C for 1 minute, 53°C for 1.5 
minutes, 72°C for 2 minutes within each cycle; and for a total of 30 cycles. 
Finally, the reaction mixture was incubated at 72°C for 10 minutes. The reaction 
mixture was extracted by phenol/chloroform, precipitated with ethanol, and gel 
10 purified in 1.2% low-melting agarose gel. The purified DNA fragment was 
inserted into the pT7bleu (R) vector (Novagen, Inc.) to generate the pT7yM 
plasmid {see Figure 5). In 20 ul DNA ligation reaction, 2 ul of 10 mM ATP, 40 
units of the T4 DNA ligase (New England Biolab, Inc.) was added and mixed with 
I ug purified monellin DNA fragment and 50 ng pT7bIue (R) vector. The reaction 
15 mixture was kept at 16°C for 16 hours. The ligation mixture was transformed into 
host cells by adding 5 ul of the ligation mixture to 200 ul of coli NovaBlue 
competent cells (Messing, Methods in Enzymology; ] 983, 101:20-78) and the 
desired sequence was confirmed by dideoxy sequencing using T7 and U19 primer 
(Sanger et al., Proc. Natl. Acad. ScL, 1985, 74:5463-5467). 
20 (1) 

6.2. Preparation of the Expression Vector pGWYS-1 

The pGAPZa expression vector was purchased from Invitrogen, Inc. The 
synthetic monellin DNA fragment was removed from pT7yMenal]in with EcoRI 
and inserted into an EcoRI site of the pGAPZa vector to give pGWYS. Briefly, 5 

25 ug purified pT7yMenallion plasmid was digested in 20 ul reaction volume using 5 
units EcoRI (Promega Inc.) at 37^ for 2 hours. After the reaction mixture was 
separated by 1% low-melting agarose gel electrophoresis, the synthetic monellin 
DNA fragment was purified using the Wizard PGR Preps DNA purification kit 
(Promega, Inc). One hundred ng purified monellin DNA fragment were used for 

30 ligation into the expression pGAPZa vector. In the 10 ul ligation reaction, 50 ng 
of the EcoRi digested pGAPZa vector was mixed with 100 ng purified monellin 
DNA fragment. The ligation reaction was carried out in the presence of 10 ul of 



wo 00/40603 



PCT/US99/29213 



17 

20 mM Tris-HCl (pH 7.5), 10 mM MgCl, 10 mM DTT and 200 units of the T4 
DNA Ugase (New England Biolabs, Inc.) at 16°C overnight to give the pG WYS. 
The ligation mixture \yas transformed into E. colt TOPIOF' cells (Invitrogen, Inc.). 
Twenty Zeocin-resistant clones were picked and orientation of the insert was 
5 screened by PGR reaction using the a-factor ( 5'CTATTGCCAGCATTGCTGC3') 
(SEQ ID NO;4) and the N4 oligos. Each selected clone was transferred into 3 ml 
LB culture medium containing 200 ug/ml Zeocin and incubated with shaking at 
37°C overnight. The recombinant plasmid pGWYS was prepared from 1.5 ml 
cultured bacteria cells using the Qiagen Tip 20 kit system (Qiagen, Inc.). Fifty ng 

10 purified pGWYS plasmid was used as the PGR template to determine orientation of 
the insert. In 25 ul PGR reaction, 50 ng of the pGWYS plasmid was mixed with 
2.5 pM of the a-factor and the N4 oligos in the presence of 1 unit Taq DNA 
polymerase (Promega, Inc.). The PGR reaction was performed under the following 
conditions: at 94°G for 1 minute, 55°G. for 1 minute, 72°C for 2 minutes within 

15 each cycle; and for a total of 40 cycles. Finally, the reaction mixture was 

incubated at 72"C. for 10 minutes. After the PGR reaction mixture was separated 
on 1.2% agarose gel, one of the clones which contains the insert with the desired 
orientation was named pGWYS-1. The sequence of the insert was further 
confirmed by DNA sequencing. 

20 

6.3. Transformation of Picltia pastoris Cells with the pGWYS-1 

To generate the high-level and stable expression of monellin in Pichia 
pastoris, purified pGWYS-l plasmid was transformed into Pichia pastoris cells by 
electroporation technique described in the Pichia pastoris Expression Kit Manual 

25 using the Electroporation Apparatus II (Invitrogen Inc.). Briefly, 500 ml of the 
Pichia pastoris GSl 15 cells were grown in YPD medium at 30°C to an ODuoo 
1.3. Cells were pelleted with a centrifugation of 1,500 g for 5 minutes at 4''C. 
Pelleted cells were washed with 500ml of ice-cold sterile water. The washing step 
was repeated with 250 ml and 20 ml ice-cold sterile water, receptively. Then, the 

30 cells were washed with 20 ml of ice-cold 1 M sorbitol and resuspend in I ml ice- 
cold sorbitol. Forty ul of the yeast GSl 15 cells in 1 M sorbitol were mixed with 
10 ug purified pGWYS-1 plasmid to total volume 50 ul and the mixture was 
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transferred into an ice-cold cuvette. The cuvette containing the mixture was 
incubated on ice for 5 minutes. Electroporation was performed according to the 
Electroporation Apparatus II manual parameters (Invitrogen Inc manufacture). 
After the electrical pulse, 1 ml of ice-cold IM sorbitol was added into the cuvette, 
5 and the content of the cuvette was transferred into a microcentrifuge tube. Two 
hundred ul transformed cells were plated on one 5 RDB plate containing 400 ug/ml 
Zeocin. The plates were incubated at 30°C until colonies appeared. Positive 
transformants were characterized by their growth in the presence of Zeocin at 
various concentrations, e.g.., 400 ug/ml, 600 ug/ml, 800 ug/ml and lOOOug/ml. 

10 

6.4. Stabih'tv Test of the Positive Transformants 

Three positive transformants were selected for further characterization based 
on their growth in the presence of 800ug/ml Zeocin and the expression of 
recombinant monellin by the 2% glucose induction. The following experiment was 

15 performed to test their genetic stability. Each of these 3 positive transformants was 
picked up using a sterile toothpick and incubated on a YPD plate without any 
selection at 3 CC until colonies appeared. The colonies were picked up and plated 
on a new YPD plate until new colonies appeared. After such non-selective growth 
was repeated 50 times, each of the passage colonies was incubated on a selective 

20 plate containing 800 ug/ml Zeocin. The protein expression upon 2% glucose 

induction was analyzed by SDS-PAGE. All three positive transformants showed 
the same phenotype as the original colonies after 50 times passage on the YPD 
plates. 

6.5 Production of the Recombinant Monellin Protein 

25 Each of the three positive transformants selected in 5.4. was grown inl liter 

YPD medium in 5 liter flask at 30°C with vigorous shaking (250rpm). Two ml 
supernatant were obtained from the culture after 24 hours, 48 hours and 72 hours, 
respectively. Five ul of the samples collected at each time point were analyzed 
using the 15-20% gradient polyacrylamide gel. The secreted recombinant monellin 

30 protein was observed as the 12 kD protein band. Quantitation of the SDS-PAGE 
analysis using the Densitometer (Molecular Dynamic, Inc.) indicates that one of the 
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positive strain produced nearly 10 grams per liter secreted recombinant single-chain 
monellin protein. This strain was named GWySl. 

6.6 Purification of Secreted Recombinant Monellin Protein 

5 Protein methods were used to purify the recombinant monellin protein 

secreted from GwyS-1 yeast strain. According to the first method, after 72-hour 
culturing, supernatant was collected by a centrifugation at 12,000 rpm {17,000g). 
After collection, the supernatant pH was adjusted to about 6.8 using 0.1 N NaOH 
solution. One M NaH,POj-Na,HPO, (pH6.8) was added into the supernatant till 
10 1:100 (v/v) and mixed well. The supernatant was then loaded on the CM-Sephadex 
column (Phamacia, Inc.) pre-equilibrated with 0.01 M NaH,PO^-Na,HPOj (pH6.8) 
solution. After the column was washed with 5 column volume 0.01 M NaH,POj- 
Na,HPO^ (pH6.8) solution, the recombinant monellin protein was eluted with 0.3 M 
NaCl-0.01 M NaH,PO,-Na,HPO, (pH6.8) solution. After dialysis against water, the 
15 purity of the protein was determined to be about 98% by gel electrophoresis. 

According to the second method, after 72-hour culturing, supernatant was 
collected by a centrifugation at 12,000 rpm (I7,000g). After collection, the 
supernatant pH was adjusted to about 7.2 using 0.1 N NaOH solution. One M 
NaCl-1 M NaH,PO,-Na,HPO^ (pH 7.2) was added into the supernatant till 1:100 
20 (v/v) and mixed well. The supernatant was then loaded on the DEAE-Sephadex 

column (Phamacia, Inc.) pre-equilibrated with 1 M NaH,P04-Na,HP0j (pH 7.2)-lM 
NaCl solution. The flow-through fraction was collected and dialyzed against water. 
The purity of the protein was determined to be about 98% by gel electrophoresis. 
The recombinant moriellin protein purified according to either method was 
25 further lyophilized to dry powder for testing its sweetness. 

6.7. Sweetness and Stabiiitv Test 

Sweetness of the purified recombinant monellin protein was assessed using 
an ordinary taste test. Comparison to the sweetness of sucrose was made by 
30 suitable dilutions on a weight basis. In a typical test, 1, 10, 25 and 50 mg/ml 

aqueous sucrose solutions were used as standard solutions. The minimum weight of 
the purified recombinant monellin protein which could generate sweet taste was 
compared with that of sucrose. The recombinant monellin of this invention 
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requires the addition of an amount which is about 1000-fold less than that of 
sucrose. For example, 50 ng/ml recombinant monellin protein solution was as 
sweet as 50 mg/ml sucrose (Lucky Supermarket's Lady Lee brand sugar). 

Stability was measured by dissolving natural monellin (Sigma, Inc.) and the 
5 purified recombinant monellin protein at 100 ug/ml concentration at pH 2.0, 4.0, 
6.3. and 7.5. Each sample was heated to 37°C, 50°C, 60°C, 70"C, 80°C, 90°C and 
100°C for 15 minutes and let cool to room temperature before tasting. The most 
dramatic difference was that natural monellin lost its sweetness when heated to 
50°C at pH 2.0, while the purified recombinant monellin protein retained its 
10 sweetness even after heating at 100°C for 5 minutes. 

The present invention is not to be limited in scope by the microorganism 
deposited or the specific embodiments described herein. Indeed, various 
15 modifications of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description and 
accompanying figures. Such modifications are intended to fall within the scope of 
the appended claims. 

Various references are cited herein, the disclosures of which are 
20 incorporated by reference in their entireties. 
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WHAT IS CLAIMED IS: 



1. An isolated nucleic acid comprising a nucleotide sequence 
encoding a chimeric protein, said chimeric protein comprises, from N-terminus to 
C-terminus: 

a) a first peptidyl fragment consisting of an amino acid sequence that 
has at least 40% identity to residues 1-50 of the B chain of native 
monellin, in which the percentage identity is determined over an 
amino acid sequence of identical size to the B chain of native 
monellin; 

b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 
amino acids; and 

c) a third peptidyl fragment consisting of an amino acid sequence that 
has at least 40% identity to residues 1-45 of the A chain of native 
monellin, in which the percentage identity is determined over an 
amino acid sequence of identical size to the A chain of native 
monellin, 

wherein said chimeric protein is stable and a given amount of said chimeric protein 
is at least 100-fold sweet as compared to the identical amount of sucrose, and 
within said nucleic acid, codons which are preferably used by yeast cells are used. 

2. The isolated nucleic acid of claim 1, wherein the first 
peptidyl fragment consists of an amino acid sequence that has at least 60% identity 
to the B chain of native monellin. 

3. The isolated nucleic acid of claim 1, wherein the first 
peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the B chain of native monellin. 

4. The isolated nucleic acid of claim 1, wherein the first 
peptidyl fragment consists of the amino acid residues 1-50 of the B chain of native 
monellin depicted as the amino acid residues 1-50 in Figure 1 (SEQ ID NO:5). 

5. The isolated nucleic acid of claim 1, wherein the second 
peptidyl fragment consists of the amino acid residue Gly. 
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6. The isolated nucJeic acid of claim 1, wherein the second 
peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser (SEQ ID- 
NO:2). 

7. The isolated nucleic acid of claim 1, wherein the second 

5 peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser-GIy-Gly- 
Gly-Ser-GIy-Gly-Gly-Ser (SEQ ID NO: I). 

8. The isolated nucleic acid of claim 1, wherein the third 
peptidyl fragment consists of an amino acid sequence that has at least 60% identity 
to the A chain of native monellin. 

10 9. The isolated nucleic acid of claim ], wherein the third 

peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the A chain of native monellin. 

10. The isolated nucleic acid of claim 1, wherein the third 
peptidyl fragment consists of the amino acid residues 1-45 of the A chain of native 

15 monellin depicted as the amino acid residues 52-96 in Figure 1 (SEQ ID NO:5). 

11. The isolated nucleic acid of claim 1 which nucleic acid 
encodes the amino acid residues 1-96 of Figure 1 (SEQ ID NO:5). 

12. The isolated nucleic acid of claim 1, wherein the chimeric 
protein is capable of being immunoreactively bound by an anti-monellin antibody. 

20 13. The isolated nucleic acid of claim 1, wherein the chimeric 

protein is capable of being immunoreactively bound by an anti-thaumatin antibody. 

14. The isolated nucleic acid of claim 1, wherein the chimeric 
protein further comprises an amino acid sequence which is capable of directing 
secretion of said chimeric protein from Pichia pastoris. 

25 15. The isolated nucleic acid of claim 14, wherein the secretion- 

directing sequence is an endogenous signal sequence of Pichia pastoris. 

16. The isolated nucleic acid of claim 15, wherein the 
endogenous signal sequence is selected from the group consisting of the signal 
sequence of P/c/i/a pastoris acid phosphatase, Pichia pastoris aspartic proteinase 

30 and Pichia pastoris carboxypeptidase Y encoded by Pichia pastoris PRCl. 
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17. The isolated nucleic acid of claim 14, wherein the secretion- 
directing sequence is a yeast signal sequence, wherein said yeast is not Pichia 
pastoris. 

18. The isolated nucleic acid of claim 17, wherein the yeast 
5 signal sequence is a signal sequence from Saccharomyces cerevisiae. 

19. The isolated nucleic acid of claim 18, wherein the 
Saccharomyces cerevisiae signal sequence is selected from the group consisting of 
the signal sequence of Saccharomyces cerevisiae SUC 2 and Saccharomyces 
cerevisiae mating pheromone a-factor. 

10 20. The isolated nucleic acid of claim 19, wherein the 

Saccharomyces cerevisiae signal sequence is the signal sequence of Saccharomyces 
cerevisiae mating pheromone a-factor. 

21. The isolated nucleic acid of claim 11, further comprising an 
amino acid sequence which is capable of directing secretion of said chimeric 

15 protein from Pichia pastoris. 

22. The isolated nucleic acid of claim 21, wherein the secretion- 
directing sequence is the signal sequence of Saccharomyces cerevisiae mating 
pheromone o- factor. 

23. The isolated nucleic acid of claim 14, wherein the secretion- 
20 directing sequence is selected from the group consisting of the signal sequence of 

Aspergillus giganteus alpha-Sarcin, aipha-N-Acetylgatactosaminidase, OmpA 
protein, the mouse alpha-factor (cCell), the pepper endo-beta-1 ,4-glucanases, the 
laccase isolated from the ligninolytic fungus Trametes, murine lysosomal acid 
atpha-mannosidase, the porcine inhibitor of carbonic anhydrase, Aspergillus 
25 awamori glucoamylase, mouse major urinary protein, phol, rabbit 

angiotensin-converting enzyme (ACE), and the bacterial thermostable alpha 
amylase. 

24. The nucleic acid of claim 1, wherein said nucleic acid is a 

DNA. 

30 25. An isolated nucleic acid comprising a nucleotide sequence 

complementary to the nucleotide sequence of claim 1. 
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26. An isolated nucleic acid hybridizable to the DNA sequence of 

claim 24. 

27. The DNA of claim 24, further comprising a promoter which 
is capable of directing protein expression in Pichia pastoris. 

28. The DNA of claim 27, wherein the promoter is an 
endogenous promoter of Pichia pastoris. 

29. The DNA of claim 28, wherein the endogenous promoter is 
the promoter of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase. 

30. The DNA of claim 24 said DNA encodes the amino acid 
residues 1-96 of Figure 1 (SEQ ID NO:5) and said DNA further comprises the 
promoter Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase and the 
signal sequence of Saccharomyces cerevisiae mating pheromone a-factor. 

31. The DNA of claim 24, wherein the codons which are 
preferably used by Pichia pastoris cells are used. 

32. A DNA molecule comprises nucleotide sequence as depicted 

in Figure 1. 

33. The pGWYSl DNA vector as depicted in Figure 4. 

34. A recombinant Pichia pastoris cell containing the nucleic 
acid of claim 1. 

35. A recombinant Pichia pastoris cell containing the DNA of 

claim 32. 

36. A recombinant Pichia pastoris cell containing the DNA of 

claim 33. 

37. A process for producing a chimeric protein comprising 
growing a recombinant Pichia pastoris eel! containing the nucleic acid of claim 1 
such that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 

38. A process for producing a chimeric protein comprising 
growing a recombinant Pichia pastoris cell containing the DNA of claim 32 such 
that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 



wo 00/40603 



PCT/US99/29213 



25 

39. A process for producing a chimeric protein comprising 
growing a recombinant Pichia pastoris cell containing the DNA of clairn 33 such- 
that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 

40. The process of claim 37, wherein the expressed and secreted 
chimeric protein is recovered by a means comprising ion-exchange 
chromatography. 

41. The process of claim 40, wherein the ion-exchange 
chromatography being used is CM-Sephadex column chromatography. 

42. The process of claim 40, wherein the ion-exchange 
chromatography being used is DEAE-Sephadex column chromatography. 

43. The product of the process of claim 37. 

44. The product of the process of claim 38. 

45. The product of the process of claim 39. 

46. The product of the process of claim 40. 

47. The product of the process of claim 41. 

48. The product of the process of claim 42. 

49. A chimeric protein, said chimeric protein comprises, from N- 
terminus to C-tenminus: 

a) a first peptidyl fragment consisting of an amino acid sequence that has at 
least 40% identity to residues 1-50 of the B chain of native monellin, in which the 
percentage identity is determined over an amino acid sequence of identical size to 
the B chain of native monellin; 

b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 amino 
acids; and 

c) a third peptidyl fragment consisting of an amino acid sequence that has at 
least 40% identity to residues 1-45 of the A chain of native monellin, in which the 
percentage identity is determined over an amino acid sequence of identical size to 
the A chain of native monellin, 

wherein said chimeric protein is stable and a given amount of said chimeric protein 
is at least 100-fold sweet as compared to the identical amount of sucrose, and 
within said nucleic acid, codons which are preferably used by yeast ceils are used. 
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GGT GAG TGG GAG ATT ATT GAG ATT GGT CCA TTC ACT" 
Gly Gly Trp GIu He He Asp He Gly Pro Phe Thr 

CAA AAC TTG GGT AAG TTC GCT GTT GAC GAG GAG AAC 
Gin Asn Leu Gly Lys Phe Ala Val Asp GIu Glu Asn 

AAG ATT GGT CAA TAG GGT AGA TTG ACT TTC AAC AAG 
Lys He Gly Gin Tyr Gly Arg Leu Thr Phe Asn Lys 

GTT ATT AGA CCA TGT ATG AAG AAG ACT ATT TAC GAG 
Val He Arg Pro Cys Met Lys Lys Thr He Tyr Glu 

AAC GAG GGT TCT AGA GAG ATT AAG GGT TAC GAG TAC 
Asn Glu Gly Ser Arg GIu He Lys Gly Tyr GIu Tyr 

CAA TTG TAC GTT TAC GCT TCT GAC AAG TTG TTC CGT 
Gin Leu Tyr Val Tyr Ala Ser Asp Lys Leu Phe Arg 

GCT GAC ATT TCT GAG GAC TAC AAG ACT CGT GGT CGT 
Ala Asp He Ser Glu Asp Tyr Lys Thr Arg Gly Arg 

AAG TTG TTG AGA TTC AAC GGT CCA GTT CCA CCA CCA 
Lys Leu Leu Arg Phe Asn Gly Pro Val Pro Pro Pro 

TAA 
Stop 



FIG.l 
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Ml 

5' AGA ATT CGG TGA GTG GGA GAT TAT TGA CAT TGG TCC ATT 
CAC TCA AAA CTT GG 3' 

M2 

5' GAA CAA GAT TGG TCA ATA CGG TAG ATT GAC TTT CAA CAA 
GTT TAT TAG GCC ATG T 3' 

M3 

5' GAG ACC GAG GGT TCT AGA GAG ATT AAG GGT TAC GAG TAG 
CAA TTG TAC GTT TAC GCT TC 3' 

M4 

5' GTG CTG ACA TTC CTG AGG ACT AC A AGA CTC GTG GTC GTA 
AGT TGT TGA GAT TC 3' 

Nl 

5' GTA TTG ACC AAT CTT GTT CTC CTC GTC AAC AGC GAA CTT 
ACC CAA GTT TTG AGT GAA TG 3 ' 

N2 

5' CTC TAG AAC CCTCGT TCT CGT AAA TAG TCT TCT TCA TAC 
ATG GTC TAA TAA CCT TG 3' 

N3 

5' GTC CTC AGA AAT GTC AGC ACG GAA CAA CTT GTC AGA AGC 
GTA AAC GTA CAA TTG 

N4 

5' AGA ATT CTT ATG GTG GTG GAA CTG GAC CGT TGA ATG TCA 
ACA ACT TAC GAC 3' 



FIG. 2 
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Ml M2 M3 M4 

1 I I ■ 11 I I — EcoRI 



Nl N2 N3 N4 



FIG. 3 



SUBSTITUTE SHEET (R ULE 26) 
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Comments for pGWYSI (3479 bp) 

GAP Promoter region: 1 -483 
Aipha-factor signal sequence: 493-760 
Moneliin coding region: 762-1059 
3' AOX 1 termination region: 1060-1306 
TEF1 Promoter region: 1307-1709 
EM7 Promoter region: 1710-1781 
Sh ble ORF: 1782-2518 
CYC1 termination region: 2159 2477 
CoIEl origin ( pUC-derived): 2478-3479 



FIG. 4 



SUBSTITUTE SHEET (R ULE 26) 
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PGR amplification with 
mixed M and N oligo^ 




pT7 cloning vector 



ligation 



Recombinant single chain 
Monellin gene 




EcoR I digestion 



Monellin gene EcoRI fragment 



GAP 




EcoRI 



GAP 



EcoR I digestion Ligation 



pGAPZa 




pGWYSl 



FIG. 5 
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Lane 1 . Protein MW Marker 

Lane 2. 5ul Culture Medium 

Lane 3. Partially Purified Recombinant 
Single Chain Monellin 

Lane 4. 40ug Native Monellin 



FIG.6 
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Recombinant Yeast Clone 










20 ml Culture 










I Liter Culture | 






r 


20 Liter Culture 





Culture Time: 48 hours Medium: YPD 

Temperanire: 30 °C Shaking Speed: 150 rpm 



Culture Time:2 4 hours Medium: YPD 

Temperature: 30 °C Shaking Speed: 150 rpm 



Culture Time: 12 hours Medium: YPD 

Temperature: 28 - 30 "C 
Shaking Speed: 300 rpm pH 6.4 



Culture Time: 12 hours Medium: M5 

Temperature: 28 - 30 °C 
Shaking Speed: 300 rpm pH 6.4 



400 Liter Culture 



Recovery Supernatant By 
Centrifugation 
12,000 g. 10 mimutes 



FIG. 7a 
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Recovery Culture Supernatant 
By Centrifugation 
12,000 g 10 mimutes 



Method 1 



Method 2 



Adjust Supernatant pH to 6.8 
Add 1/100 Vol IM IMNaCl 



Adjust Supernatant pH to 7.2 
Add 1/100 Vol 
IM NaH.,PO^-Na.;HPO^ 



Load to CM-Sephadex column 



Elute Protein Sample With 
0.01 M NaH^PO^-Na^HPO^ 
( pH6.8) 0.3NNaCl 



Load to DEAE-Cellulose 
DE52 column 



Collecting Protein Sample 



Concentrating Protein Sample 
0.3KD -0.6KD MW Cut Off 



FIG. 7b 
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SEQUENCE LISTING 

<110> GENWAY BIOTECH, INC. 
Duan, Lingxun 

<12 0> PRODUCTION OF RECOMBINANT MONELLIN USING 
METHYLOTROPHIC YEAST EXPRESSION SYSTEM 



<130> 46433-20002.00 

<14 0> To be Assigned 
<141> 

<150> US 60/114,529 
<151> 1998-12-31 

<ieO> 14 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptidyl fragment of- the chimeric protein 
<400> 1 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 10 

<210> 2 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptidyl fragment of the chimeric protein 
<400> 2 

Gly Gly Gly Ser 
1 

<210> 3 
<211> 30 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 3 

Met Leu Leu Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu 

15 10 15 

Glu Gly Val Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe 
20 25 30 



<210> 4 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> alpha-factor 
<400> 4 

ctattgccag cattgctgc 

<210> 5 
<211> 96 
<212> PRT 

<213> Pichia pastoris 

Gly°Gly Trp Glu He He Asp He Gly Pro Phe Thr Gin Asn Leu Gly 

15 10 15 

Lys Phe Ala Val Asp Glu Glu Asn Lys He Gly Gin Tyr Gly Arg Leu 

20 25 30 

Thr Phe Asn Lys Val He Arg Pro Cys Met Lys Lys Thr He Tyr Glu 

35 40 45 

Asn Glu Gly Ser Arg Glu He Lys Gly Tyr Glu Tyr Gin Leu Tyr Val 

50 55 60 

Tyr Ala Ser Asp Lys Leu Phe Arg Ala Asp He Ser Glu Asp Tyr Lys 
65 70 75 80 

Thr Arg Gly Arg Lys Leu Leu Arg Phe Asn Gly Pro Val Pro Pro Pro 



<210> 6 
<211> 291 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligos used for synthesis of the recombinant 
single-chain monellin protein 

ggtgagtggg agattattga cattggtcca ttcactcaaa acttgggtaa gttcgctgtt 

gacgaggaga acaagattgg tcaatacggt agattgactt tcaacaaggt tattagacca 

tgtatgaaga agactattta cgagaacgag ggttctagag agattaaggg ttacgagtac 

caattgtacg tttacgcttc tgacaagttg ttccgtgctg acatttctga ggactacaag 

actcgtggtc gtaagttgtt gagattcaac ggtccagttc caccaccata a 

<210> 7 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location Ml in the synthesized 
monellin DNA 

<400> 7 

agaattcggt gagtgggaga ttattgacat tggtccattc actcaaaact tgg 



120 
180 
240 
291 



2 



<210> 8 

<211> 55 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location M2 in the synthesized 
tnonellin DNA 

<400> 8 

gaacaagatt ggtcaatacg gtagattgac tttcaacaag tttattaggc catgt 

<210> 9 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location M3 in the synthesized 
tnonellin DNA 

<400> 9 

gagaccgagg gttctagaga gattaagggt tacgagtacc aattgtacgt ttacgcttc 

<210> 10 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location M4 
in the synthesized monellin DNA 

<400> 10 

gtgctgacat tcctgaggac tacaagactc gtggtcgtaa gttgttgaga ttc 

<210> 11 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location Nl in the synthesized 
monellin DNA 

<400> 11 

gtattgacca atcttgttct cctcgtcaac agcgaactta cccaagtttt gagtgaatg 

<210> 12 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location N2 in the synthesized 
monellin DNA 
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<400> 12 

ctctagaacc ctcgttctcg taaatagtct tcttcataca tggtctaata accttg 56 

<210> 13 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location N3 in the synthesized 
monellin DNA 

<400> 13 

gtcctcagaa atgtcagcac ggaacaactt gtcagaagcg taaacgtaca attg 54 

<210> 14 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide in location N4 in the synthesized 
monellin DNA 

<400> 14 

agaattctta tggtggtgga actggaccgt tgaatctcaa caacttacga c 51 
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